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PROCESS FOR THE PURIFICATION AND CRYSTALLIZATION OF PROTEASOME 



The invention concerns a process for isolating a 
purified eukaryotic crystallizable proteasome 
preparation and the proteasome preparation obtainable by 
the process. In addition the invention concerns a 
purified eukaryotic proteasome preparation in a 
crystallized form. With the aid of the crystal data from 
this proteasome preparation it is possible to identify 
and obtain new proteasome inhibitors , especially with 
the aid of computer-aided modelling programs. 

Proteasome is the central enzyme in protein degradation 
in the cytosol as well as in the cell nucleus. It is 
involved in many biological processes including the 
removal of abnormal, misfolded or falsely assembled 
proteins , the reaction to stress (by processing or 
degradation of transcription regulators) , cell cycle 
control (by degradation of cyclines) , cell 
differentiation and metabolic adaptation (by destruction 
of transcription factors or metabolic enzymes) and the 
cellular immune reaction (by generation of antigenic 
peptides that are presented by MHC class I molecules) . 
These cellular functions which are based on a ubiquitin 
and ATP-dependent degradation of proteins require the 
26S proteasome whose nucleus and proteolytic chamber is 
formed by the 2 OS proteasome. 

The 2 OS proteasome from Archaebacterium Thermoplasma 
acidophilum was analysed by X-ray structure 
crystallography at a resolution of 0.34 nm. It has a 
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cylindrical shape with a length of 14.8 nm and a maximum 
and minimum diameter of 11.3 nm and 7.5 nm respectively. 
It is composed of 28 subunits which are arranged in a 
particle as 4 homoheptameric rings a7P737a7 with D7 
symmetry (Lowe et al., (1995), Science 268, 533-539). In 
the T. acidophilum proteasome the N-terminal threonine 
residue of the J3 subunits is the binding site of 
inhibitory peptide aldehydes and is essential for the 
hydrolytic activity. Stock et al. ((1996), Current 
Opinion in Biotechnology, 7: 376-385) also describe the 
structure and function of T. acidophilum proteasomes. The. 
T. acidophilum data cannot be used for eukaryotic 
proteasomes since the homology of the proteasomes 
between these species is too low. 

Eukaryotic proteasomes are considerably more complex 
than the archaebacterial proteasome. Thus the 2 OS 
proteasome from Saccharomyces cerevisiae is composed of 
a total of seven different a type and seven different (J 
type subunits which have already been cloned and 
sequenced cf. e.g. Heinemeyer et al. (1994), 
Biochemistry 33, 12229-12237). 

The eukaryotic 2 OS proteasomes e.g. from yeast and from 
mammals are very closely related with regard to the 
amino acid sequences of subunits and their coarse 
structure recognizable by electron microscopy. The a 
type and p type subunits of the mammalian 20S proteasome 
form an ordered and well-defined structure (Kopp et al. 
(1995), J. Mol. Biol. 248, 264-272). In mammalian cells 
three additional non-essential subunits of the 2 OS 
proteasome which are named LMP2, LMP7 and MECL1 can 
replace constitutive components after induction with the 
cytokine interferon y. Their expression or specific 
deletion changes the peptidase specificity of the 



- 3 - 



proteasome and the expression rate of MHC class I 
molecules on the cell surface. 

Hilt, Heinemeyer and Wolf ((1993), Enzyme Protein 47: 189- 
201) describe the structure of 20S and 26S yeast 
proteasomes and the proteolytic activity of p type 
subunits. In addition the different functions of 20S and 
2 6S proteasomes are discussed with regard to the 
metabolism and differentiation of a cell. Crystal lographic 
data of eukaryotic proteasomes are not described. 

Starting materials that have previously been used to 
purify proteasomes such as tissue and cells of mammals 
such as mouse, rat, human or bovine, other animals, 
plants and yeast are listed in the publication by Rivett 
et al. (1994), Methods Enzymol. 244, 331-350) and the 
citations included therein. 

There are also numerous documents in the patent 
literature which relate to proteasomes. Thus for example 
the production of eukaryotic proteasomes is described in 
EP-A-03 45 750, JP-A-05 292 964 and JP-A-06 022 759. 
However, the proteasome preparations that are disclosed 
do not have sufficient purity to enable crystallization. 

Morimoto et al. ((1995), J. Biochem. 117, 471-474), 
Hwang et al. ((1994), Mol. Cells, vol. 4, 273-275) and 
Perkins et al. ((1994), Journal of Structural Biology 
113, 124-134) describe the crystallization of eukaryotic 
proteasomes. Resolutions of only 0.44 nm, > ca. 5.0 nm 
and 1.5 nm respectively are achieved due to the low 
purity of the proteasome preparations so that a 
structural determination or molecular modelling is not 
possible with these proteasome preparations. 
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Nucleotide and amino acid sequences of proteasome 
subunits are described for example in the Japanese 
applications JP-A-04 077 497, JP-A-04 077 498, JP-A-04 
117 283, JP-A-05 317 059, JP-A-07 255 476, JP-A-08 116 
972, JP-A-08 205 871 and JP-A-08 217 796 and in the 
Japanese Patent 4 0 51 896. 

Proteasome inhibitors are described for example in JP-A- 
05 000 968, WO 92/20 804, WO 94/17 816, WO 95/24 914, WO 
95/25533, WO 96/13 266, WO 96/32 105 (lactacystin 
analogues) and US-A-55 80 854 (peptide aldehyde 
inhibitors) . 

Klafky et al. ((1995), Neuroscience Letters 201, 29-32) 
examine the effect of the proteasome inhibitor calpain 
inhibitor 1 on the secretion of 3 amyloid peptide which 
is formed by cleavage of the p amyloid precursor protein 
(APP) and which has been discussed to be a trigger of 
Alzheimer's disease. There are no proteasome structural 
data in this publication. 

Fenteany et al. ((1995), Science, vol. 268, 726-731) 
describe the streptomyces metabolite lactacystin as a 
cell cycle inhibitor and inducer of neurite outgrowth of 
a mouse neuroblastoma cell line. The 20S proteasome was 
identified by means of tritium-labelled lactacystin as 
the specific cellular target of this inhibitor. A 
crystallizable proteasome preparation is not described. 

WO 91/13904 describes the identification and 
characterization of a chymotrypsin-like protease which 
is present as a multicatalytic protease and its use for 
treating Alzheimer's disease. The use of substrates that 
are specific for chymotrypsin activity to test or screen 
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for inhibitors as described in this document only leads 
to the identification of inhibitors that are specific 
for a chymotrypsin-like activity. 

Hence it is apparent that there is a great need for 
further information about proteasomes especially with 
regard to their exact structure in order to enable the 
preparation of new proteasome inhibitors in a rational 
manner. Thus the object of the invention was to provide 
a process which enables the crystallization of 
eukaryotic proteasome preparations so that the crystal 
structure can be used to simplify the development of new 
inhibitors . 

The inventive object is achieved by a process for isolating 
a purified eukaryotic proteasome preparation comprising the 
steps : 

(a) production of a crude extract by lysing eukaryotic 
cells, 

(b) separation of insoluble components from the crude 
extract, 

(c) chromatographic separation into fractions by means 
of an ion exchange medium, e.g. Q-Sepharose, 

(d) testing the fractions obtained in step (c) and 
collecting the active fractions, 

(e) chromatographic separation over hydroxyapatite, 

(f) testing the fractions obtained in step (e) and 
collecting the active fractions, 

(g) concentrating the pooled fractions, 

(h) chromatographic separation over a gel filtration 
medium in a molecular weight range of 5 kD to 5 MD, 
e.g. Superose and 

(i) testing the fractions obtained in step (h) and 
collecting the active fractions. 
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Any eukaryotic cells can be used as a starting material 
for the process according to the invention e.g. animal 
cells, plant cells or fungal cells such as yeast cells. 
The use of yeast cells e.g. Saccharomyces cerevisiae is 
particularly preferred. 

The fractions are usually tested during the purification 
process by determining the typical proteolytic activity 
for proteasomes. In this connection known chromogenic 
peptides can for example be used as substrates. The 
fractions are preferably tested by carrying out two 
parallel determinations of the proteolytic activity in 
each case, one of which is carried out in the absence 
and the other in the presence of a proteasome inhibitor 
e.g. lactacystin. This type of testing allows the 
fractions containing proteasomes to be unequivocally 
differentiated from other fractions with proteolytic 
activity. 

The concentration process comprises three 
chromatographic separation steps (c) , (e) and (h) of 
which at least one can be carried out in a FPLC system 
e.g. step (h) . 

A purified proteasome preparation is obtained by the 
process according to the invention which is present in 
an adequate amount and purity to enable a subsequent 
crystallization. 

Hence a further subject matter of the present invention 
is a purified eukaryotic proteasome preparation which is 
obtainable by the process according to the invention. 
Yet a further subject matter of the present invention is 
a purified eukaryotic proteasome preparation in a 
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crystallizable form. Yet a further subject matter of the 
present invention is a purified crystallized eukaryotic 
proteasome preparation. 

The crystallized proteasome preparation can also contain 
a proteasome inhibitor. Examples of suitable known 
proteasome inhibitors are lactacystin or analogues 
thereof and tripeptide aldehydes such as calpain 
inhibitor. 

The eukaryotic proteasome preparation according to the 
invention comprises a 2 OS proteasome i.e. a complex of 
28 subunits each of which contains two molecules of 
seven different a type subunits and seven different P 
type subunits. In addition the complex can also contain 
metal ions e.g. magnesium, solvent molecules e.g. water 
and other polypeptide components. 

The purified eukaryotic proteasome preparation according 
to the invention can be used to identify and isolate new 
proteasome inhibitors. Data from the crystal structure 
of crystallized eukaryotic proteasome preparations are 
used in particular for this. The identification and 
isolation of new proteasome inhibitors is preferably 
carried out using a computer-aided modelling program. 

For example the inhibitor can be designed by visually 
inspecting graphic representations of the structure and in 
particular by 

(a) determining the accessible volumes for ligands at 
active sites e.g. with the aid of the programs 
INSIGHT, SYBYL, QUANTA, FRODO, 0 etc., 



(b) determining ideal ligand properties with regard to 
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hydrophobicity or hydrogen bonds e.g. with the aid of 
the programs LUDI, GRID, etc. or/ and 

(c) determining the electronic properties of surfaces 
that are accessible to ligands at the active sites 
e.g. with the aid of the program GRASP etc. 

Alternatively or additionally it is also possible to 
determine ligands by automated ligand fragment docking 
or adaptation procedures eg. with the aid of the 
programs DOCK, LUDI, LEAPFROG etc. 

For this purpose it is particularly preferable to use the 
crystal data for the proteasome subunits of the P type 
in particular for the proteasome subunits (35/PRE2, 
P1/PRE3 or /and 02/PUP1 or homologous subunits from other 
eukaryotic proteasomes and neighbouring subunits thereof 
e.g. p4/Cll or/and p7/PRE4. 

In order to design inhibitors of the human proteasome it 
is possible to modify the inventive crystal structural 
data of the yeast proteasome by homology modelling using 
known amino acid sequences of the human proteasome. Such 
a homology modelling can be carried out by molecular 
graphic programs such as 0, INSIGHT, FRODO, etc.. In 
particular the present invention encompasses a homology 
modelling of the homologous active sites of the active 
monomers in general and in particular for the purpose of 
inhibitor design. The homology of the amino acid 
sequences of the yeast proteasome and of the human 
proteasome in the relevant regions is shown in Figure 1. 

In addition it is intended to illustrate the invention 
by the following examples and figures. 
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Figure 1 




shows the homology between the amino acid 

squences from yeast and humans coding for the 
active subunits of the proteasome; the pl/PRE3, 
P2/PU^, P5/PRE2 subfamilies are indicated by 
the yelxow, green and blue colour respectively; 
the residues of the SI pocket which influence 
the specif id^ty changes of the PRE 3 subfamily 
after substitution of the human subunit Y by 
LMP2 after cytokine induction are shown in 
brown; 



Figure 2 shows the topology of the 28 subunits of the 
2 OS proteasome drawn as spheres, 

Figure 3 shows the C** chain positions of the subunits 

(J7/PRE4, 06/C5, 01 1 /PRE3 , p2 ■ /PUP1 and p3 1 /PUP3 
in which the p-cis and p-trans-p interactions 
by contacts of insertion segments are 
highlighted, 



Figure 4a to b show electron density maps (contoured 

starting at 1 a) in similar orientations 
around THR1 with two F Q -F C coefficients 
after double averaging; the red parts of 
the model were omitted via the phasing. 
P5/PRE2 with the covalently-bound 
* lactacystin (LACT) and the water molecule 
NUK (a) and P7/Pre4 with a part of a 
propeptide (b) , 



Figure 5 shows a scheme of the proposed chemical steps 
of autolyosis and substrate hydrolysis. 
Generation of a processing intermediate by 
hydrolysis on the acidic p ring surface (A) . 
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Generation of the completely processed active 
subunit via an acyl enzyme (B) and its 
hydrolysis (C) . Michaelis complex of a 
substrate polypeptide (D) . Cleavage on the 0 
ring surface and formation of the acyl enzyme 
(E) associated with peptide cleavage. Acyl 
enzyme hydrolysis and release of the 
octapeptide product (F) . 

Figure 6a to c show the binding of the calpain inhibitor 

and the SI pockets, 01/PRE3 is shown in 
grey with the PI contacting residues shown 
in red; P2/PUP1 is shown in green and the 
inhibitor is shown in blue (a); p2/PUPl 
(b) , P5/PRE2 (c) have an analogous colour 
scheme; 

Figure 7 shows the lower half of the p-p chamber. The 
main chain with the red circles for the 
carbonyl oxygens is shown for the C-terminal 
sections of the helices H2 of the seven p type 
subunits which define the p ring area. The 
intermediary processed and the unprocessed 
propeptides of the subunits p6/C5, p7/PRE4, 
P3/PUP1 and P4/C11 (green) and the calpain 
inhibitor (yellow) bound to pl/PRE3 , p2/PUPl 

and P5/PRE2 are shown. Two 
magnesium ions that are located near to the 
P ring area are shown as silver circles; and 

Figure 8 shows a surface view of the proteasome molecule 
cut along the cylinder axis. Three of the six 
calpain inhibitor molecules bound to pl/PRE3, 
P2/PUP1 and P5/PRE2 are shown in red as space 
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filling models. The sealed a openings at the 
two ends of the particle, a few narrow side 
windows and the sharply cut inner 3 ring 
surfaces can be seen. 



Examples 



Example 1 



Protein preparation and characterization 



Yeast cells of Saccharomyces cerevisiae (Hefe-Mayr, 
Munich, Germany) were washed twice with ice-cold water 
and suspended in buffer A (2 0 mM Tris-HCl, pH 7.5, 1 mM 
EDTA, 1 mM NaN 3 ) in a weight ratio of cells to buffer of 
2:3. The cells were disintegrated for 10 min in a 
grinder (Biomatik, Germany) with glass balls (diameter: 
0.5 mm; volume ratio of glass balls to cell suspension: 
3:2). The disruption of the cells was monitored 
microscopically. 

After filtration the crude extract was centrifuged for 
10 min at 10,000 x g in a Sorvall RC 2B centrifuge. The 
supernatant was again centrifuged for 45 min at 134,000 
x g in a Ti-55.2 rotor (Beckmann) . The lipids from the 
upper most layer were carefully removed and the 
remaining yellow solutions were combined. The protein 
concentrations were about 50 mg/ml. 

Immediately after centrif ugation the extract was applied 
to a Q-Sepharose column (5 x 20 cm) which had been 
equilibrated with 280 mM NaCl in buffer A. The column 
was washed with 280 mM NaCl in buffer A, the proteins 
were eluted with a gradient of 280 to 800 mM NaCl. The 
flow rate was 120 ml/h and 12 ml fractions were 
collected. The proteasome was eluted at 400 - 450 mM 
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NaCl. Chymotrypsin-like (CL) , peptidylglutamyl-peptide- 
hydrolase (PGPH) and trypsin-like (TL) enzymatic 
activies were measured in all fractions. 

In order to obtain the 2 OS proteasome the CL activity 
was again measured in all active fractions in the 
presence of lactacystin and the fractions with a reduced 
activity were collected. The combined fractions were 
diluted three-fold with water and applied to a 
hydroxyapatite column (3 x 10 cm) which had been 
equilibrated with 60 mM potassium phosphate, pH 7.5. The 
column was washed with 60 mM potassium phosphate pH 7.5 
and eluted with a gradient of 60-300 mM potassium 
phosphate. The flow rate was 60 ml/h. 12 ml fractions 
were collected. The CL, PGPH and TL activity was 
measured in all fractions and the active fractions were 
combined. 

The combined fractions were concentrated 2 0-fold by 
ultrafiltration using a AMICON YM30 membrane and the 
concentrate was applied to a Superose 6 column (1 x 3 0 cm) 
equilibrated with buffer A. The elution was carried out 
with a flow rate of 18 ml/h in buffer A. The proteasome 
eluted after 37 min. In this manner it was possible to 
obtain 50 mg crystallizable protein from 500 g yeast 
cells . 

All preparative steps with the exception of FPLC were 
carried out at 4°C. The chromogenic peptide substrates 
were dissolved in dimethyl sulphoxide at a concentration 
of 1 mM. The proteolytic activity towards these 
substrates was determined according to Achtstetter et~ 
al. (1994), J. Biol. Chem. 259, 13344-13348. The 
chromogenic peptide substrates were obtained from Bachem 
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(Bubendorf, Switzerland). Q-Sepharose and hydroxy apatite 
were obtained from Sigma and BioRad. The FPLC device, 
the MonoQ and Superose 6 column were obtained from 
Pharmacia (Freiburg, Germany) , all other chemicals were 
obtained in the highest possible purity from Merck 
(Darmstadt, Germany) . 

Example 2 Crystallization 

The crystals were grown in hanging drops at 24 °C. The 
protein concentration which was used for crystallization 
was 40 mg/ml in 10 mM Tris/HCl (pH 7.5) and 1 mM EDTA. 
The drops were composed of 4 /xl of the protein solution 
and 2 /il of a reservoir solution which contained 40 mM 
magnesium acetate, 0.1 M morpholinoethane sulfonic acid 
(pH 6.5) and 12 % 2 , 4-methylpentanediol . The crystals 
containing the inhibitor lactacystin were produced by 
immersion in a 1 mM lactacystin solution for 6 h. The 



crystals containing the inhibitor acetyl-Leu-Leu- 
norleucine (calpain inhibitor I, Boehringer Mannheim) 
were produced by immersion in a 5 mM calpain solution 
for 6 h. The crystal lographic data are shown in Table 1. 



The crystals were very-well ordered and exhibited only a 
slight anisotropy. Thus a resolution of 0.24 nm was 
possible. The acetyl-Leu-Leu-norleucinal-inhibited 
crystals were slightly less ordered. 

The anisotropy of the diffraction was corrected using 
the found structure factor amplitudes with those that 
were calculated from a model with isotropic temperature 
factors using XPLOR (Bruenger, 1992). The data sets were 
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Crystallography 
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obtained with the BW6 beam line at the DESY Hamburg 
using a synchrotron radiation of X = 0.11 nm. The 
crystals were immersed in an antifreeze buffer (30 % 
MPD; 2 8 mM magnesium acetate, 0.1 M morpholinoethane 
sulfonic acid, pH 6.9) and frozen in a stream of 90 °K 
cold nitrogen gas. The diffraction data were collected 
with a 3 00 mm Mar research imaging plate at a distance 
of 275 mm (LACT) or 280 mm (CAL) . The X-ray intensities 
were determined using the MOSFLM computer program 
version 5.3 and the data reduction was carried out with 
CCP4 (Leslie (1992), Acta Cryst. D50, 760-763; Joint 
CCP4 and ESF-EACMB, Newslett. Protein Crystallogr. 
(Daresburg Laboratory Warington UK 26M Collaborative 
Computational Project Number 4 (1994) . 

A rotation function calculated at 0.5 nm resolution 
showed two peaks that were related to the crystal 
symmetry which indicates the presence of local diadic 
molecule axes at v|/ 86° 4> 90° and \j/ 94° <f> 90°. Their 
correlation values were half the value of the 
crystallographic diad that would have been expected for 
an almost ideal molecular two-fold symmetry. The T. 
acidophilum model was used for the Patterson search 
calculation using AMoRe (Navaza (1994), Acta Cryst. A50, 
157-163) at a resolution of 0.35 nm. This showed that if 
one takes into consideration the D7 symmetry of the 
experimental model, there is a single solution with a 
correlation value of 0.32 and an R factor of 56 % 
compared to the next-highest peak of 0.28 and 57 %. 

The T. acidophilum model was reduced to polyalanine with 
only a few conserved residues which remained in the a 
type subunit. This model yielded an R factor of 57.7 % 
and was used to calculate a 2F Q -F C map at 0.24 nm with 
X-PLOR (Bruenger (1992), X-PLOR version 3.1). A system 
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for X-ray crystallography and NMR was used. The 
electronic density was averaged in real space using MAIN 
(Turk (1992), Dissertation, Technical University Munich) 
and using the local double axis in the present model 
(H/=85.1, (|>=90.8, k=180.1), transformed back, and a new 
density was calculated with 2F Q -F C coefficients. After 
10 adjusting cycles the quality of the map was good 
(Rback" 27 * 3 % ) • The individual subunits were identified 
on the basis of their characteristic insertions, 
deletions and amino acid sequences and were incorporated 
into the map on an ESV-30 graphic system work station 
(Evans & Sutherland, Salt Lake City, Utah) using FRODO 
(Jones (1978), J. Appl. Cryst. 11, 268-272). A 
crystallographic refinement was carried out with X-PLOR 
(Bruenger, 1992) with energetic and double non- 
crystallographic symmetry constraints using the 
parameters described by Engh and Huber (1991), Acta 
Cryst. A 47, 392 - 400. In addition a scattering 
component for the solvent was calculated in order to 
correct for an anisotropic crystal arrangement and was 
incorporated into the calculation of the model during 
the refinement. 

The final model has taken into account the inhibitor 
molecules lactacystin and acetyl-Leu-Leu-norleucinal, 18 
magnesium ions and 1,800 water molecules. The R values 
are satisfactory and the standard geometry of the bonds 
and angles are excellent. The local molecular diadic 
symmetry is well-conserved which is also shown by the 
very low value R hac ^ 13 % in the final step of the 
analysis. The increase in the R value by 3 % for data 
with a resolution of 0.28 nm compared to 0.2 4 nm is a 
result of the anisotropic crystal arrangement which 
impairs the data quality and of the limited 
incorporation of ordered solvent molecules. 



- 16 - 



Example 4 Characterization of the structure 

4.1 Structure of subunits 

The 14 genes cloned from yeast which code for components 
of the 2 OS proteasome can be divided into seven a type 
and seven P type subunits. 

The p type subunits are synthesized as precursors which 
are processed into the mature forms present in the 
assembled proteasome. The mature P type polypeptides 
P2/PUP1, P5/PRE2 and P1/PRE3 are obtained from their 
proforms by cleavage between Gly-1 and Thrl with release 
of the active site Thrl, whereas p7/PRE4 is cleaved 
between Asn-9 and Thr-8 and P6/C5 is cleaved between 
His-10 and Gln-9 and are present as stable processing 
intermediates. p4/Cll and P3/PUP3 are not processed and 
begin with Met(-l) or Met (-9) respectively. The subunits 
PUP1, PRE 2 and PRE 3 are referred to as completely 
processed, the subunits PRE4 and C5 as partially 
processed and the subunits Cll and PUP 3 as unprocessed. 

All 14 subunits are present in the crystalline molecular 
structure at well-defined positions. They are almost 
completely defined by the electron density apart from a 
few chain termini and long insertion segments. 

The electron density for the main chains is defined as 
follows in the a type subunits: a2/Y7: Thr5-Leu236, 
(X3/Y13: Gly4-Gly237, 0C4/PRE6: Tyr8-Gln244, a5/PUP2: 
ArglO- Glu243 (7 residues of the insertion are not 
defined - Glyl2 to Arg 126), a6/PRE5: Phe4 - Ile233, 
(X7/C1: Gly5 - Asn241, al/C7: Gly6 - Asp240. 



In the p type subunits the electron density is defined 
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as follows: p3/PUP3: Ser-8-Asp 193, 36/C5: Gln-9-Asp 
193, p4/Cll: Met-1-Glnl92, P7/PRE4: Thr-8-Ile211, 
32/PUPl: Thrl-Cys221, pl/PRE3 : Thrl-Leul96, p5/PRE2: 
Thrl-Gly211. 

All seven a and P type polypeptides have a 
characteristic P sandwich structure. It is composed of 
two five-strand antiparallel p-folded sheet structures 
with the overlying helical layers composed of the 
helices H3, H4, H5 and the underlying helices HI and H2. 
However, they differ in the bends which vary in length 
by one or two amino acid residues, in long insertions 
which connect secondary structural elements and in the 
N-terminal and in particular in the C-terminal regions. 

In the a type subunits a2/Y7 has a long insertion loop 
between the strands S9 and S10 which is composed of a 
short a helix and a p strand. al/C7 has an extension of 
the helix H3 by two bends as a result of the insertion 
at G180. The subunits al/C7, a3/Y13, a4/PRE6, CC5/PUP2 
and on /CI have longer C-terminal helices H5 which 
protrude into the solution from the particle surface. 
The highly charged, mostly acidic C-terminal segments 
are unstructured. 

In the case of the p type subunits with long insertions, 
P7/PRE4 has a sharp bend between the helices HI and H2 
and an additional a helix with 2 bends at residue 145. p 
6/C5 has an insertion of 17 residues between H3 and H4 
with a complex folding and a short helix. P2/PUP1 has a 
very long C-terminal extension whose last 11 residues 
are very disordered. The subunits p3/PUP3 and P6/C5 have 
short C-termini so that the helices H5 do not exist and 
the strands S10 are extended to enlarge the P-folded 
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sheet. The helix H5 exists in p4/Cll but it is two folds 
shorter than in the case of T. acidophilum. 

Many of these subunit-specif ic folds, insertions and N- 
and C-termini are involved in the contacts between 
subunits as discussed in the following. 

4.2 The (C7, Y7 . Y13 , PRE 6 . PUP 2 . PRES. CI: PRE 3 . 

PUP1. PUP 3 , Cll, PRE2. C5. PRE4^ n complex 

Each of the seven a type subunits has two neighbours 
within the heptameric ring which exhibit a-cis 
interactions and one or two neighbouring P type subunits 
in another ring with a-trans-P interactions In addition 
to the p-cis and P-trans-a interactions, the central P 
type subunits have one or two neighbouring p type 
subunits in the other p ring with p-trans-P 
interactions . 

The general architecture of the quarternary structure is 
the same in the proteasome of T. acidophilum and yeast 
(Fig. 2) : The N-terminal loop segment, helix HO 
(residues 20 to 30) , loop L, the loop connecting H2 and 
S5 and the strand S7 mediate a-cis interactions. The p- 
cis contacts which appear to be less close include the 
loop L, the N-end of the helix HI, the strand S7 and the 
bend linking the strand SB and the helix H3 . These 
contacts are derived from the D7 symmetrical precursor 
and are also found in the T. acidophilum proteasome. 
Despite the conserved architecture these contacts are 
specific for the respective subunits due to their 
specific amino acid sequences. 

Their are many additional contacts which are absent in 
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T. acidophilum and result from sequences and sequence 
insertions which are generally characteristic for yeast 
and eukaryotes. Within the a rings, close a-cis contacts 
are made by the intertwined N-termini of the subunits 
ctl/C7, a2/Y7, (X3/Y13 and <x7/Cl in the centre of the 
heptameric ring. Tyr8 which is conserved in all subunits 
plays a central role. Within the P rings there is a very 
specific contact between p2/PUPl and (3 3 /PUP 3 which is 
mediated by the long C-terminal arm of PUP1 which 
includes PUP 3 and almost touches the next but one 
neighbour p4/Cll. p-trans-a contacts are made by the 
helix HI loop helix H2 motifs which interact with the 
same motifs of two neighbouring subunits. This basic 
contact motif was also seen in the T. acidophilum 
structure (see Figure 4a by Lowe et al. (1995), Science 
268, 3479-3486) but the insertion at residue 66 of 
P7/PRE4 favours its association with (X6/PRE5 and a7/Cl. 
In the same manner the long insertion in a2/Y7 at 
residue 210 between strands S9 and S10 binds to p2/PUPl 
and couples this pair. Specific P-trans-P interactions 
are formed by the C-terminal arm of P7/PRE4 which is 
intercalated between p2 , /PtJPl and P1VPRE3. The C- 
terminal segment of p5/PRE2 interacts with P3'/PUP3 and 
p4 f / C11 in a similar manner (Figure 3). The long 
insertion of p6/C5 at residue 145 contacts subunit 
p3VPUP3 and the C-terminal arm of p2 f /PUPl, 

Very specific P-trans-p interactions are mediated by 
magesium ions: magnesium Y8 bridges the main chain 
carboxylate of Aspl93 from p6/C5 with the loop 162 to 
167 of P2 , /PUP1. i n the same manner the magnesium Y9 
bridges the subunits p3/PUP3 via Aspl93 with p5*/PRE3. 
In addition these carboxylate groups are ligands for 
other magnesium ions which are located in the loops 165 
of P4/PUP3 (magnesium W6) and P6/C5 (magnesium W4) and 
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can play a role in stabilizing the subunit structure. 
The aspartate residues are completely covered and their 
side chains participate in charge-charge interactions 
with Arg 19 of p2 ■ /PUP1 and Arg 19 of P5'/PRE2 which 
further strengthens the p-trans-p contacts. The p type 
subunits 01/PRE3 and p4/Cll are located at the single 
molecule diads of the yeast proteasome and are very 
similar to the dominant p-trans-P contact the residues 
133-137 of the helix H3 of T. acidophilum. 

18 magnesium positions were identified in the proteasome 
molecule of which 12 are located on the inner walls of 
the p-P chamber and which prove the acidic nature of 
this compartment which is discussed in the following. It 
can be seen that the numerous specific interactions 
between the subunits determine their specific and 
unequivocal positions within the proteasome. 

4.3 The N-terminal threonine position 

A catalytic system with Thrl, Glul7 and Lys33 was 
defined in the* T. acidophilum proteasome by structural 
and mutation investigations (Lowe et al. (1995), supra 
and Seemiiller et al. (1995), Science 268, 579-582). 

The residues Serl29, Serl69 and Aspl66 are located close 
to Thrl which are required for the structural integrity 
of this position but could also be involved in the 
catalysis. It was shown by mutagenesis that Aspl66 in 
the proteasome of T. acidophilum is essential (Seemiiller 
et al. (1996), Nature 382, 468-470). These residues are 
invariant in the active subunits PUP1, PRE2 and PRE3 . 

In addition a completely bound solvent molecule NUK was 
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found in all three subunits close to ThrlO 7 and N, 
Serl290 7 and N and Gly47N as shown as an example for the 
subunit /S5/PRE2 in the lactacystin complex (Figure 4) . 
This was not recognized in a lower resolution in the 
model of T . acidophilum . ThrlN has hydrogen bridges to 
Serl680 and 0 7 and Serl290 7 . ThrlO 7 has a hydrogen bridge 
to Lys33 f . Aspl7 has hydrogen bridges via 0 61 to Argl9N 
and Glyl70N and via O 62 to Thr/Ser2N and Lys3 3N f . In a 
similar manner Lys3 3N f has three hydrogen bridges to 
Aspl70 62 , Argl90 and ThrlO 7 . 

The pattern of hydrogen bridges leads one to assume that 
Aspl7 as well as Lys3 3 are charged. ThrlN can form a 
hydrogen bridge to ThrO 7 and is presumably neutral , a 
state which is favoured by a nearby positively charged 
lysine residue. Such a charge distribution would also be 
expected from the respective standard pKa values. ThrlN 
is therefore very probably the proton acceptor, if 
ThrlO 7 is part of an electrophilic centre . This is 
confirmed by the structure of the lactacystin complex 
which has an ester between lactacystin and Thrl as a 
result of a j8 lactone ring opening after a nucleophilic 
attack by ThrlO 7 . ThrlN is at exactly the position to 
serve as a proton shuttle from ThrlO 7 to 
lactacystin-06 ' . An analogous reaction sequence is 
proposed for the hydrolysis of the C-terminal 
f luorophores of f luorogenic substrates where the proton 
transfer takes place in the amide nitrogen of the 
leaving group. The generated acyl enzyme is deacylated 
by the water NUK as shown in the sections D-E of figure 
5. Alternatively or concurrently NUK could directly 
attack the peptide bond thus circumventing the 
intermediate I . 
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4.4 Inhibitor binding 

03/PUP1, 01/PRE3 and P5/PRE2 have the inhibitor acetyl- 
Leu-Leu-norleucinal bound covalently to ThrlO? presumably 
as a hemiacetal. It assumes a P conformation and fills 
the gap between strands which contain the residues 20 and 
21 and 47 (allocated to the loop L in figure 3 by Lowe et 
al., 1995, supra) to which it is bound via hydrogen 
bridges which generates an antiparallel 3 folded sheet 
structure. The norleucine side chain extends into a 
pocket (the SI pocket) the side of which is open towards 
a tunnel which leads to the particle surface. The leucine 
side chain in P2 is not in contact with protein and the 
leucine side chain in P3 is in contact with the 
neighbouring (J-subunits. The SI specificity pocket is mainly 
formed by the residues 20, 31, 35, 49, 53 i.e. Ala20, 
Val31, Ile35, Met45, Ala49, Gln53 (K) in P5/PRE2 (fig. 
6c), Thr20, Thr31, Thr35, Arg45, Ala49, Gln53 in (31/PRE3 
(fig. 6a), Ser20, Cys31, His35, Gly45, Ala49, Glu53 in (32 
PUP1 (fig. 6b). The residue 45 forms the bottom of the 
pocket and appears to largely determine its character. 
Neighbouring subunits in the P rings also contribute to 
the SI pockets and modulate their character: p2/PUPl in 
the case of pl/PRE3 with Hisll4, Hisll6, Serll8, Aspl20; 
P3/PUP3 in the case of P2/PUP1 with the residues Aspll4, 
Aspl20 and Cisll8 and P6/C5 in the case of p5/PRE2 with 
Serll8, Aspll4, Glul20 and Glul22. 

Lactacystin is covalently bound to p5/PRE2. This is in 
agreement with the observed chemical modification of 
subunit X of the mammalian proteasome (Fenteany et al., 
(1995), Science 268, 726-730) the homologue of PRE2 . Its 
dimethyl side chain at CIO extends into SI like a valine 
or leucine side chain but less deeply than the norleucine 
side chain of calpain. Lactacystin forms several hydrogen 
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bridges with atoms of the protein main chain LactN- 
Gly470, Lact04 f -Gly47N, Lact09 ' -Thr21N, Lact06 • -ThrlN. 
Since these latter interactions can also occur in (32/PUP1 
and 01/PRE3 which form no covalent complexes with 
lactacystin, the SI side group which binds in the 
hydrophobic SI pocket of 05/PRE2 appears to direct the 
formation of a covalent bond and its stabilization. Thus 
this side group is an important starting point for the 
development of inhibitors. 

4.5 Specificity 

P5/PRE2 has a methionine residue at position 45 which is 
in contact with the branched side chain of lactacystin in 
the complex. In the calpain-inhibitor complex the 
norleucine side chain of calpain pushes the methionine 
side chain by up to 0.27 nm towards Ile35 which rotates 
out of the way. This concerted movement makes the SI 
pocket more spacious. This is compatible with the 
observation that lactacystin inhibits the chymotryptic 
activity towards chromogenic substrates. In a similar 
manner the chymotryptic activity is reduced in 
proteasomes with a (35/PRE2 mutant which cannot be 
processed from their proform (Chen & Hochstrasser (1996) , 
Cell 86 961-972) and by a mutation in 05/PRE2 where a 
substitution of Ala49 by Val in the SI pocket limits the 
size (Heinemeyer et al. (1993), J. Biol. Chem. 268, 5115- 
5120) . pl/PRE3 has an arginine residue in position 45 at 
the bottom of the SI pocket which is well suited for 
glutamate PI residues. It is most probably the subunit 
that is associated with the peptidylglutamyl-peptide 
hydrolysis activity (PGPH) of the proteasome. However, 
the norleucine side chain also occupies this basic pocket 
in the calpain inhibitor complex, A high additional 
density peak was observed which is associated with the 
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guanidinium side chain and can be interpreted as a 
cMoride or carbonate ion which compensates a non- 
equalized positive charge. 02/PUP1 has a glycine as 
residue 45 and consequently a spacious SI pocket, the 
bottom of which is bordered by His35 and Glu53. 

We conclude that 05/PRE2 contains the chymotryptic as 
well as the tryptic activity whereas pl/PRE3 contains the 
PGPH activity but that both pockets are adaptable with 
regard to size (PRE2) and polarity (PRE3) • 02/PUP1 is 
suitable for very large PI residues with a basic 
character. Mutation analyses have shown that 
substitutions in P4/C11 and 07/PRE4 influence the 
chymotrypsin-like and the PGPH activity (Heinemeyer et 
al., (1993) supra; Hilt & Wolf (1996), TIBS 21, 96-102, 
Hilt et al. (1993), J. Biol. Chem. 268, 3479-3486). These 
subunits are inactive but are located in the vicinity of 
the subunits J35/PRE2 and (31/PRE3 from both rings (figure 
4). The substitution of Serl36 by the voluminous Phe in 
P4/C11 interferes with the p-trans-p contact on helix H3 
between P4/C11 and p5/PRE2 and can interfere with the 
neighbouring Thrl position, as presumably also the 
deletion of 15 C-terminal residues of p7/PRE4, which form 
extensive contacts with pl/PRE3 (fig. 3) . 

4.6 Propeptides and processing 

Five P type subunits are synthesized using propeptides of 
different lengths of up to 75 amino acids which are 
cleaved during maturation. P2/PUP1, p5/PRE2 and P1/PRE3 
exhibit an autolysis between Gly-1 - Thrl. This is a 
process which requires the presence of Thrl, Gly-1 and 
Lys33. We had already suggested an autolysis within the 
subunit in which ThrlO? as a nucleophile attacks the 
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preceding peptide bond (Schmidtke et al. (1996) , EMBOJ. 
15, 6887-6898). 

According to the crystal structure the water NUK is 
allocated a central role. It is ideally positioned to act 
as a base for the removal of a proton of ThrlO? and to 
promote nucleophilic addition to the carbonyl carbon of 
Gly-1. There is no information about the position and 
orientation of the Gly-l-Thrl peptide group in the 
completely processed subunits but we can derive them from 
partially processed or unprocessed subunits 03/PUP3, 
(36/C5 and 07/PRE4 which have similar orientations. In 
these subunits Gly-10 is directed towards the positively 
charged Lys33N^ and of Gly47N which form an oxygen anion 
hole in analogy to serine proteases in order to 
distribute the negative charge that is formed when the 
tetrahedral adduct is formed. A rearrangement to form the 
ester can take place after proton transfer from the water 
NUK to ThrlN and cleavage of the peptide bond. The nearby 
residues Serl290Y and Serl690? support this reaction. Both 
hydroxyl groups are bound to Aspl66 which is invariant in 
the active subunits via hydrogen bridges. NUK probably 
also participates in the ester hydrolysis as an attacking 
nucleophile which is ultimately incorporated into the 
product (fig. 5, sections a to c) . The Gly-1 residue 
appears to be essential since a side chain at position -1 
would interfere with the protein backbone at position 168 
and would force a configuration which is unsuitable for 
autolysis. 

When Thrl is released the subunits become active. If the 
catalytic site is not intact as in the subunits 03/PUP3, 
P6/C5 and 04/011 in which Thrl is absent, in 07/PRE4 in 
which Lys3 3 is replaced by Arg and in constructed 
variants of LMP2, the mammalian homologue of 01/PRE3 
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(Schmidtke et al. (1996) , supra) and of PRE2 (Chen & 
Hochstrasser (1996), supra) an autolysis at residue 1 
does not occur. 07/PRE4 has both essential residues Gly-1 
and Thrl but in a configuration which differs strongly 
from that in which the active subunits are found since 
the Thrl side chain is pushed away by the larger Arg33 
which replaces the lysine residue (figure 4b) . The finding 
of defects in the catalytic activity and in the 
processing demonstrate the structural lability of the 
Thrl site which can be impaired by mutations of 
neighbouring residues of the same or neighbouring 
subunits. On the other hand it is also possible that an 
inactive mutant can become active in the vicinity of 
active subunits which is in accord with observations that 
T.acidophilum species which have a defect in processing 
are processed when coexpressed with wild-type protein 
(Seemttller et al. (1996), supra). 



The propeptides play an essential role in the assembly of 
eukaryotic proteasomes which may be due to direct or 
indirect effects by participation in interactions between 
subunits and/or by stabilizing the structure of subunits. 
The observed structures of the processing intermediates 
of 07/PRE4(M) and 06/C5 and of the unprocessed propeptide 
03 /PUP 3 indicate that both effects occur since the 
propeptides are firmly bound to the residue of the 
protein and interact with other subunits e.g. propeptide 
07/PRE4 with 01/PRE3 and residues 92 and 115 and 
propeptide 06/05 with 07/PRE4 at 91 and 116. 

4-iJZ Entry i nto and exit from the oroteasome particle 

The hydrolytic activity of the proteasome is associated 
with Thrl and the 0 ring surfaces in the interior of the 
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P cavity that defines the hydrolytic chamber. The 
substrate must penetrate into the particle and the 
product must W released. In the case of the proteasome 
from T.acidoplidlum two entry openings with a diameter of 
about 1.3 nm atfe open at the ends of the cylindrical 
particles which We bordered by a ring surface of bend- 
forming segmentsVryrl2 6-Gly-Gly-Val of seven identical a 
subunits. The N-t&rminal residues 1 to 12 are disordered 
in this protein. \ 



In contrast the hydrolytic chamber of the 2 OS proteasome 
of yeast is almost inaccessible. The N-termini of the 
subunits al/C7, a2/Y7, a3/Y13, (X6/PRE5 and (X7/C1 
protrude into the opening and fill it completely with 
several layers of closely interwoven sidechains (figure 
8) . Hence there is no access to the interior of the 
particle from the cylinder ends without a considerable 
rearrangement. There some narrow side windows in 
particular at the interface between the a and P rings 
which are more permeable than in the T. acidophilum 
proteasome since smaller side chains are present there. 
These openings are mainly located between the tooth-like 
helix Hl-bend-helix H2 motifs of the a-P interface (see 
figure 4a by Lowe et al. (1995), supra) and lead to the 
N-terminal threonine residues of the active centre. They 
are covered by polar and charged amino acid side chains 
which can move in order to generate openings of about 
1 nm diameter and possibly allow the passage of unfolded 
stretched polypeptide chains. The 19S particle which is 
responsible for the ATP dependency and ubiquitin 
dependency of proteolysis by the proteasome is attached 
to the particles to form the 26S proteasome. The 
association leads to a strong activation of peptide 
hydrolysis (Hoffman and Rechsteiner (1994), J. Biol. 
Chem. 269, 1690-1695). The proteasome regulator PA28 is 
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bound in a similar manner to a type subunits (Kania et 
al. (1996), Euro. J. Biochem. 236, 510-516). It 
accelerates peptide cleavage and improves the antigen 
processing. Both regulatory factors could open the entry 
openings in a controlled manner in vivo. 

4.8 Generation of MHC class I peptides 

The 2 OS proteasome generates peptide products with a 
narrow length distribution, mainly octapeptide or 
nonapeptides, a size range which is optimal for binding 
MHC class I molecules (York & Rock (1996) , Annu. Rev. 
Immunol. 14, 369-396). In vitro experiments have shown 
that peptides generated from intact proteins by 2 OS 
proteasomes are presented by MHC class I molecules (Dick 
et al. (1994) Immunol. 152, 3884-3894; Niedermann et al. 
(1996), Proc. Natl. Acad. Sci. USA 93, 8572-8577). In an 
in vivo experiment it was shown that proteasome 
inhibitors inhibit the MHC class I presentation of 
protein antigens (Rock et al. (1994), Cell 78, 761-771) 
and that the number of the MHC class I molecules present 
on the cell surface is regulated by the inducible 
proteasome subunits p5i/LMP7 and PH/LMP2 as shown in 
mice with site-directed deletions of the genes coding for 
these proteins (Fehling et al. (1994), Science 265, 1234- 
1237) . LMP2 and LMP7 replace the constitutively expressed 
subunits after IFN-y stimulation. 

MHC class I peptides usually have basic or hydrophobic C- 
terminal residues (see the review article by Engelhard 
(1994), Curr. Opin. Immunol. 6, 13-23). The LMP2/7 
substitution presumably changes the distribution of 
peptides so that a larger proportion of the peptides 
preferred by MHC class I molecules is generated. LMP2 
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replaces Y the human homologue of (51/PRE3, LMP7 replaces 
X the homologue of P5/PRE2 . All members of this subfamily 
exhibit a high degree of sequence identity but 01i/LMP2 
has two conspicuous differences compared to 01/PRE3 in 
the SI pocket: Thr31 -> Phe and Arg45 — > Leu. The 
substitution of Arg by Leu makes the pocket unpolar and 
the substitution of Thr by Phe makes it narrower so that 
the PGPH activity should be reduced and the chymotryptic 
activity should be increased if pli/LMP2 replaces the 
mammalian homologue for 01/PRE3. This is actually 
observed (Gaczynska et al. (1993), Nature 365, 264-267; 
Driscoll et al. (1993), Nature 365, 262-264) when LMPs 
are induced by treatment with IFN-y. The opposite effect 
is found in cell lines which lack the LMP2 and LMP7 genes 
and in mutant mice with a disruption of the LMP2 gene 
(Van Kaer et al. (1994), Immunity 1, 533-541). 
Replacement of the mammalian homologues of 05/PRE2 and 
(32/PUP1 by LMP7 and MECL1 does not directly influence the 
SI pockets and their effect cannot be due to the change 
in the specificity in PI as is found for LMP2. 

In the yeast 2 OS proteasome the subunits 07/PRE4 and 
p6/C5 are partially processed at residues -8 and -9. As a 
result octapeptide or nonapeptide products are formed 
which are not released from the enzyme. Both peptides 
have similar conformations with a thickening which 
subdivides two sections with extended conformations which 
is similar to the conformation of MHC class I-bound 
peptides. The similarity is quantified with rms 
deviations for all atoms of 0.23 nm and for the C a atoms 
of 0.13 nm by comparing the propeptide of (56/C5 with a 
viral peptide nonamer in a complex with its MHC class I 
receptor (Madden et al. (1992), Nature 321-325) and 



- 30 - 

allows the conclusion that preferred local conformations 
play a role in the generation (by the proteasome) and 
presentation (by MHC class I molecules) of immune- 
dominant peptide epitopes. 



